Categorization of Narrative Semantics for Use in Generative Multidocument Summarization

نویسنده

  • David K. Elson
چکیده

The generative summarization of textual stories has been one of the goals of computational narratology since attempts at full semantic NLU in the ’70s. Our NLP group has recently created several systems for multidocument news summarization using purely statistical methods. Between these poles, there may be an unexplored avenue where knowledge of story structure can give partial, yet useful semantic understanding to a news reader. Such knowledge can then lead to summaries more informed than those based on solely statistical means. This student paper represents work in progress on a two-module system: The first module categorizes news articles into their underlying dramatic structures; the second will attempt to use this understanding to create and execute a generative plan, concisely retelling the story to form a surface-level summary.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using N-Grams To Understand the Nature of Summaries

Although single-document summarization is a well-studied task, the nature of multidocument summarization is only beginning to be studied in detail. While close attention has been paid to what technologies are necessary when moving from single to multi-document summarization, the properties of humanwritten multi-document summaries have not been quantified. In this paper, we empirically character...

متن کامل

Experiments with CST-Based Multidocument Summarization

Recently, with the huge amount of growing information in the web and the little available time to read and process all this information, automatic summaries have become very important resources. In this work, we evaluate deep content selection methods for multidocument summarization based on the CST model (Cross-document Structure Theory). Our methods consider summarization preferences and focu...

متن کامل

Dynamic Categorization of Semantics of Fashion Language: A Memetic Approach

Categories are not invariant. This paper attempts to explore the dynamic nature of semantic category, in particular, that of fashion language, based on the cognitive theory of Dawkins’ memetics, a new theory of cultural evolution. Semantic attributes of linguistic memes decrease or proliferate in replication and spreading, which involves a dynamic development of semantic category. More specific...

متن کامل

An Integrated Multi-document Summarization Approach based on Word Hierarchical Representation

This paper introduces a novel hierarchical summarization approach for automatic multidocument summarization. By creating a hierarchical representation of the words in the input document set, the proposed approach is able to incorporate various objectives of multidocument summarization through an integrated framework. The evaluation is conducted on the DUC 2007 data set.

متن کامل

Machine and Human Performance for Single and Multidocument Summarization

coherency—and be able to draw the “best” information from a set of documents. Automatic single-document text summarization1 has been an active research area since the 1950s, with a renaissance of approaches since the 1990s. Human single-document summarization is well defined when guidelines and recommendations drive performance.2,3 System-generated single-document summaries, while not always ma...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004